AITopics | consumer device

Collaborating Authors

consumer device

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

OpenAI Just Released Its First Open-Weight Models Since GPT-2

WIREDAug-5-2025, 17:00:21 GMT

OpenAI just dropped its first open-weight models in over five years. The two language models, gpt-oss-120b and gpt-oss-20b, can run locally on consumer devices and be fine-tuned for specific purposes. For OpenAI, they represent a shift away from its recent strategy of focusing on proprietary releases, as the company moves towards a wider, and more open, group of AI models that are available for users. "We're excited to make this model, the result of billions of dollars of research, available to the world to get AI into the hands of the most people possible," said OpenAI CEO Sam Altman in an emailed statement. Both gpt-oss-120b and gpt-oss-20b are officially available to download for free on Hugging Face, a popular hosting platform for AI tools.

open-weight model, openai, openai just released, (6 more...)

WIRED

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (1.00)

Add feedback

PIPO: Pipelined Offloading for Efficient Inference on Consumer Devices

Liu, Yangyijian, Li, Jun, Li, Wu-Jun

arXiv.org Artificial IntelligenceJun-16-2025

The high memory and computation demand of large language models (LLMs) makes them challenging to be deployed on consumer devices due to limited GPU memory. Offloading can mitigate the memory constraint but often suffers from low GPU utilization, leading to low inference efficiency. In this work, we propose a novel framework, called pipelined offloading (PIPO), for efficient inference on consumer devices. PIPO designs a fine-grained offloading pipeline, complemented with optimized data transfer and computation, to achieve high concurrency and efficient scheduling for inference. Experimental results show that compared with state-of-the-art baseline, PIPO increases GPU utilization from below 40% to over 90% and achieves up to 3.1$\times$ higher throughput, running on a laptop equipped with a RTX3060 GPU of 6GB memory.

large language model, machine learning, natural language, (19 more...)

arXiv.org Artificial Intelligence

2504.03664

Genre: Research Report > New Finding (0.88)

Technology:

Information Technology > Hardware (1.00)
Information Technology > Artificial Intelligence > Natural Language > Large Language Model (0.94)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.31)

Add feedback

SpecExec: Massively Parallel Speculative Decoding For Interactive LLM Inference on Consumer Devices

Neural Information Processing SystemsMay-26-2025, 18:02:29 GMT

As large language models gain widespread adoption, running them efficiently becomes a crucial task. Recent works on LLM inference use speculative decoding to achieve extreme speedups. However, most of these works implicitly design their algorithms for high-end datacenter hardware. In this work, we ask the opposite question: how fast can we run LLMs on consumer machines? Consumer GPUs can no longer fit the largest available models and must offload them to RAM or SSD.

artificial intelligence, large language model, natural language, (6 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)

Add feedback

Evaluating Quantized Large Language Models for Code Generation on Low-Resource Language Benchmarks

Nyamsuren, Enkhbold

arXiv.org Artificial IntelligenceOct-18-2024

Democratization of AI is an important topic within the broader topic of the digital divide. This issue is relevant to LLMs, which are becoming popular as AI co-pilots but suffer from a lack of accessibility due to high computational demand. In this study, we evaluate whether quantization is a viable approach toward enabling LLMs on generic consumer devices. The study assesses the performance of five quantized code LLMs in Lua code generation tasks. To evaluate the impact of quantization, the models with 7B parameters were tested on a consumer laptop at 2-, 4-, and 8-bit integer precisions and compared to non-quantized code LLMs with 1.3, 2, and 3 billion parameters. Lua is chosen as a low-level resource language to avoid models' biases related to high-resource languages. The results suggest that the models quantized at the 4-bit integer precision offer the best trade-off between performance and model size. These models can be comfortably deployed on an average laptop without a dedicated GPU. The performance significantly drops at the 2-bit integer precision. The models at 8-bit integer precision require more inference time that does not effectively translate to better performance. The 4-bit models with 7 billion parameters also considerably outperform non-quantized models with lower parameter numbers despite having comparable model sizes with respect to storage and memory demand. While quantization indeed increases the accessibility of smaller LLMs with 7 billion parameters, these LLMs demonstrate overall low performance (less than 50\%) on high-precision and low-resource tasks such as Lua code generation. While accessibility is improved, usability is still not at the practical level comparable to foundational LLMs such as GPT-4o or Llama 3.1 405B.

benchmark, large language model, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2410.14766

Country: Europe > Ireland > Munster > County Cork > Cork (0.04)

Genre: Research Report > New Finding (1.00)

Industry: Health & Medicine (0.46)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (1.00)

Add feedback

A Hybrid Future for AI

Communications of the ACMAug-26-2024, 17:57:19 GMT

Nvidia's rise to a 2-trillion valuation at the beginning of 2024 underscored the extraordinary computing demands of artificial intelligence systems that power ChatGPT and a host of other cloud services that create videos, music, and computer programs on demand. The power of computing and memory scaling has provided much of the impetus behind the surge in interest in generative AI based on large language models (LLMs). As models get bigger they seem to harness emergent behavior, making them more useful. But, as the growth in parameter counts has easily outstripped Moore's Law, such scaling comes at a high cost. Much of the concern around resource usage has been focused on the enormous arrays of graphics processing units (GPUs) and accelerators in training grids used to train models for weeks at a time.

accuracy, engine, llm, (16 more...)

Communications of the ACM

Industry: Information Technology > Services (0.49)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.35)

Add feedback

What to expect from Microsoft Build 2024: The Surface event, Windows 11 and AI

EngadgetMay-16-2024, 18:20:10 GMT

If you can't tell by now, just about every tech company is eager to pray at the altar of AI, for better or worse. Google's recent I/O developer conference was dominated by AI features, like its seemingly life-like Project Astra assistant. Just before that, OpenAI debuted GPT 4o, a free and conversational AI model that's disturbingly flirty. Next up is Microsoft Build 2024, the company's developer conference that's kicking off next week in Seattle. Normally, Build is a fairly straightforward celebration of Microsoft's devotion to productivity, with a dash of on-stage coding to excite the developer crowd.

large language model, machine learning, natural language, (22 more...)

Engadget

Industry: Information Technology (0.35)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning > Generative AI (0.37)

Add feedback

Radair.io Launches First Consumer Device - The Radair Mini Gateway

#artificialintelligenceAug-11-2022, 08:20:20 GMT

DBA Radair.io, a provider of high-grade, multi-protocol IoT (Internet-of-Things) devices, and IoT-based enterprise solutions to drive operational efficiencies across select industries, is pleased to announce the launch of its first consumer device, the Radair Mini Gateway. The Radair Mini Gateway will support multiple ecosystems, including Helium (upon HIP19 approval) as well as The Radair Foundation's forthcoming ecosystem. The Mini Gateway will also be the industry's first light gateway with environmental monitoring that detects volatile organic compounds (VOCs), volatile sulfur compounds (VSCs), carbon monoxide, smoke, pollutants, and various gasses, with insights powered by an embedded 4-in-1 Bosch sensor. The Mini Gateway includes Wi-Fi 6E, GPS, and a barometric pressure (altitude) sensor to future-proof against changes in earning protocols, alongside the industry-leading LoRa concentrator, all elegantly built into a single device. US-based team, and an industry-leading warranty, the Radair Mini Gateway sets the new standard for IoT miners.

consumer device, radair mini gateway, recommended ai news, (4 more...)

#artificialintelligence

Genre: Press Release (0.58)

Industry: Information Technology > Smart Houses & Appliances (0.62)

Technology:

Information Technology > Internet of Things (1.00)
Information Technology > Artificial Intelligence (1.00)

Add feedback

'Smart' To 'AI' Paradigm Shift In Edge Computing

#artificialintelligenceJan-1-2022, 00:20:21 GMT

Uniquify, a Silicon Valley neural network technology and AI edge computing company, is announcing a proprietary neural network and AI modeling technology that introduces a new paradigm to transition consumer smart devices to consumer AI devices. The bottleneck to adopting advanced AI technology isn't the AI models or platforms but how to economically deploy these complex AI models for consumers at the edges. Uniquify's neural network 2.0 and AI modeling technology will enable many consumer products to become AI devices so that consumers can benefit from advanced AI models while protecting their privacy by running services at the edges. "We have seen many consumer devices like the phone, car, and TV go through a'smart' paradigm shift in the past few decades," says Josh Lee, CEO of Uniquify. "The world is ready for an'AI' paradigm shift to trigger replacement cycles in those consumer industries and more. I believe today's advanced AI models can be grafted into numerous consumer devices to provide richer experiences and enhanced capabilities for consumers. We believe we are ready to kickstart the'smart' to'AI' paradigm shift with our proprietary Neural Network 2.0 and AI modeling technology."

ai model, neural network, uniquify, (10 more...)

#artificialintelligence

Country: North America > United States > California (0.26)

Industry:

Information Technology (0.37)
Energy (0.32)
Banking & Finance > Trading (0.32)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (1.00)
Information Technology > Artificial Intelligence > Natural Language (0.56)
Information Technology > Artificial Intelligence > Representation & Reasoning (0.37)

Add feedback

What are the upcoming policies that will shape AI – and are policymakers up to the task?

#artificialintelligenceNov-20-2021, 06:53:21 GMT

As vice president and director of governance studies at the Brookings Institution, and a senior fellow at its Center for Technology Innovation, Darrell M. West spends a lot of time thinking about the intersection of policy and emerging tech. In his recent book, Turning Point: Policymaking in the Era of Artificial Intelligence, co-authored with Brookings President John R. Allen, West looks at AI use cases – "from self-driving cars to e-commerce algorithms that seem to know what you want to buy before you do" – and assesses where they're headed and how they will be shaped by policy decisions made today. The key challenge – not least in healthcare, where patient safety is paramount – is to devise regulatory guardrails that maximize the benefits of AI and machine learning and minimize their potentially dangerous downsides. In the book, West and Allen offer a series of recommendations – bolstering governmental oversight, creating new specialized advisory boards at federal agencies, third-party auditing to sniff out algorithmic bias and more. At the upcoming HIMSS Machine Learning & AI for Healthcare event, West will offer a presentation titled "The Latest Regulatory Developments Impacting Machine Learning and AI in Healthcare," where he'll explore potential new policy shifts around clinical uses of artificial intelligence: algorithmic bias, remote patient monitoring, patient safety, fitness trackers and more.

machine learning, policymaker, upcoming policy, (13 more...)

#artificialintelligence

Industry: Health & Medicine > Diagnostic Medicine (0.51)

Technology:

Information Technology > Artificial Intelligence > Machine Learning (0.69)
Information Technology > Artificial Intelligence > Robots > Autonomous Vehicles (0.56)
Information Technology > Artificial Intelligence > Applied AI (0.56)

Add feedback

Artificial Intelligence: What is it in reality ?

#artificialintelligenceJul-15-2021, 08:45:23 GMT

Artificial intelligence (AI) is the simulation of human actions and intelligence by computers. It is a combination of many technologies such as Machine Learning, Natural Language Processing and Applied Intelligence. Reactive machines don't have the ability to learn and adapt, hence they are not used for memory based scenarios and can be used for automatic responses to a limited set of inputs. Limited memory machines are capable of learning from historical data and make decisions. They use deep learning techniques for training and storing memory for these machines.

artificial intelligence, intelligence, use case, (9 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.80)

Add feedback